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Table 4: Complexity boundary involving scheduling problems and the makespan and mean flow time 
objective functions. 

3.3 Complexity Boundary Analysis 

In this section we have examined unit execution time problems involving non-chain structured tasks 
and constant distance constraints. Although allowing more general precedence constraints than sim- 
ple chains does, in general, make the scheduling problems more difficult, the more general precedence 
constraints do not necessarily make the problem W"'P-complete as we saw in Section 3.1. 

Table 4 presents the known boundary for unit execution time scheduling problems involving 
various precedence constraints with respect to the makespan and mean flow time objective functions. 
Note that the results obtained in this section are either minimal W"'P-complete results or maximal 
polynomial-time solvable results. Figure 3 depicts this boundary graphically. 

4 Conclusions 

We have presented several new complexity results for scheduling problems involving distance con- 
straints. This work sharpens the boundary between known polynomial time solvable scheduling 
problems and known W"'P-complete scheduling problems; however, there are still several open prob- 
lems of practical interest. Thel|/3i;pj = I'Jij G {0, _L}|C max problem corresponds to code scheduling 
on a single pipelined processor where some instructions have a fixed latency L and others have zero 
latency (corresponding to full bypass circuitry for some, but not all, instructions). Even though 
bypass circuitry can decrease the complexity of the code scheduling problem, it is not always prac- 
tical to add this extra hardware to the processor design. This is particularly true in Digital Signal 
Processors (DSPs). DSPs typically do not contain bypass logic; thus, code scheduling for these DSPs 
corresponds to solving the l\prec;pj = I'Jij = L (L > 2)|C max scheduling problem. 
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constrained to be scheduled in the time interval between task Xi+i and task Xi + 2- The 
resultant template has 2q non-full intervals, 7i, I2, ■ ■ ■ , I'zq, within which the remaining 
tasks are scheduled. The tasks from the D{ dags are scheduled without idle times in the 
remaining time slots of the intervals. Furthermore, due to the distance constraints and 

lengths of the chains £>(;, s ( ai ) + i) -< -D( 8 >(a,)+2) -< ••• -< A»0,)+«-i)> tlle nrst s ( a i) 
tasks from these dags, Di, 1 < i < 3q, must be scheduled in the first q intervals Ij , 
1 < J < ij , and the last s(a,i) tasks from these dags must be scheduled in the last q 
intervals Ij, q + 1 < j < 2q. 

Consider the first interval 7i and only the first s(ai) tasks in the dags Di. Let 
Si be the set of dags Di that have their first s(ai) tasks, -D( 8) i), Dti^), ■ ■ ■ , Dti tS t ai \\, 
scheduled in 7i . The template dag leaves B time units to schedule other tasks in 7i . 
Assume that ^2 D eS s(a 8 ) = B — c for some c > 0. Consider the interval I q +i. The 
template dag leaves B + 3g — 3 time units to schedule other tasks in I q +i. There are 
at most 3g tasks, Dtij\, s(a;) + 1 < j < s(ai) + q — 1, from the dags Di that may be 
scheduled in this interval. The only additional tasks that may be scheduled in this time 
interval are the final s(ai) tasks of the dags Di whose first s(ai) tasks were scheduled 
in the first interval 7i such that Di 6 Si. \Si\ < 3 by the constraints in the 3-Partition 
Problem, and c is characterized by the following function due to our assumptions on 
B and s(ai), a; 6 A. 



c > < 



Therefore, there are c — 3 + |Si| > idle time slots in interval I q +i. It follows that 
our assumption is incorrect, and ^2 D GS s(a 8 ) = B. An iterative application of this 
argument leads to the identification of sets Sj with ^2 D GS . s(a 8 ) = B for 1 < j < q and 
to the conclusion that we have a 'Yes' instance of 3-Partition. ■ 

We may assume without loss of generality that the optimum solution to l\prec;pj = l',lj t k = 
'|C max does not contain any idle times. Since this problem involves unit execution time tasks, the 
mean flow time to the optimum schedule for l\prec;pj = l',lj t k = '|C max is ^Cj = ^7=^1" i = 
^2j =1 j. This is the smallest mean flow time that any schedule may have. Therefore, finding a 
schedule to l\prec;pj = 1; Ij^ = /| ^Cj with mean flow time of X2?=i 3 finds an optimum schedule 
to l\prec;pj = l',lj t k = /|C max , and l\prec;pj = l',lj t k = A^Z^i ls strongly A'P-complete. This 
result is formalized in the following theorem. 

Theorem 3.4 l\prec;pj = l',lj t k = ^\^2Cj is NV- complete in the strong sense. 

Proof We reduce the known strongly A'P-complete problem l\prec;pj = l',lj t k = '|C max to 
l\prec;pj = l',lj t k = A^2^j- Without loss of generality we assume that the optimum 
schedule, S op t, to l\prec;pj = l',lj t k = '|C max does not contain any idle time slots. The 
mean flow time for S opt is \ Y%=\ 3 = \ J2"=i 3- 

Given an instance of l\prec;pj = l',lj t k = '|C max , we assign a weight of one 
to all tasks. Let y = X2j=ii- The optimum solution to l\prec;pj = l',lj t k = '|C max 
is a solution to l\prec;pj = l',lj t k = ^\^2Cj such that ^] =1 Q < y. Conversely, any 
solution to l\prec;pj = l',lj t k = ^\^2Cj such that X2?=i ^i — V ^ s clearly an optimum 
solution to l\prec;pj = l',lj t k = /|C ma x- The theorem follows. ■ 



21 



Given an instance of the 3-Partition Problem, we construct the following in- 
stance of l\prec;pj = l',lj t k = /|C max - We define the non-zero distance constraint to 
be 

1 = B + 3q-3. 

There are l(Aq + 1) + Aq + 2 tasks arranged in 3q + 1 separate dags. For each a; 6 A 
there is one dag A containing 2s(a*) + g - 1 tasks, £>(;,i), D(i,2), ■■■, D(i,(2s( ai )+q-i)) ■ 
The precedence constraints within D{ are defined as follows. Duj\ -< D(i, s (ai)+i), 1 < 
j < s(ai). D (iJ) -< D (ii+1) , s(ai) + 1 < j < s(a { ) + q - 1. £>( 8> ( ai ) +( ,_i) -< -D(i,j), 
s(a 8 ) + q < j < 2s(a,i) + q - 1. 

One additional dag, X, is created. X contains l(2q + l) + 4g + 2 + 2 Yli=iO ~ ^ ~ 
3(i — 1)) tasks. The central feature of X is a chain of Aq + 2 tasks, Xi,X2, . . . ,X4 g+ 2- 
All other tasks of X are connected to at least one of these chain tasks and most are 
connected to two of the chain tasks. A non-chain task has precedence relations with 
chain tasks only. For simplicity in the definition of the remaining precedence constraints 
we note a non-chain task of X as Yj . We also use the following invariant. If it is stated 
that chain task X{ precedes non-chain task Yj , X{ -< Yj , then it is also true that Yj 
precedes X i+3 , Yj -< X i+3 , if X i+3 exists. 

The remaining precedence constraints in X are defined as follows. Yoj -< X 3 , 
1 < j < I. Xi ~< Yij, 1 < j < I, 1 < i < Aq, and i is even. X; -< Yij, 1 < j < 
I — B — 3(o(i) — 1), 1 < i < 2q, i is odd, and o(i) returns the index of i in the list of 
odd numbers greater than zero, e.g., o(l) = 1, o(3) = 2, o(5) = 3, o(7) = 4. Xi -< Yij, 
1 < j < I — B — 3(q — o(i — 2q)), 2q + 1 < i < Aq, i is odd, and o(i) is defined as above. 

All distance constraints are equal to /. 

The deadline for the schedule is z = l(Aq + 1) + Aq + 2. Note that the processing 
times of all the tasks is equal to z; thus, any schedule that completes before the deadline 
must not have any idle time. It is easy to verify that this reduction requires time 
polynomial in the parameters of the 3-Partition problem. 

Suppose we have a 'Yes' instance of 3-Partition. A schedule of length z is 
constructed as follows. Start the tasks of chain X as soon as possible. A(- 4g _|_ 2 ) finishes 
at time z. The remaining Yij tasks are constrained to be scheduled in the time interval 
between task Xi+i and task A 8 _|_2- The resultant template has 2q non-full intervals, 
I\, I2, ■ ■ ■ , I'zq, within which the remaining tasks must be scheduled. These intervals 
occur between tasks Xi and Xi+i for 1 < i I < Aq and i even. The number of empty time 
slots in each interval J; are characterized by the following function. 

em P ty(Ii) = l [ B B + + %~q-i) l q+ l lVi<2q 

By assumption of a 'Yes' instance of 3-Partition, there exists q disjoint 5-task 
sets, H\, H2, ■ ■ ■ , H q , with processing time of B and comprised of the first s(ai) tasks 
from the dags Di, 1 < i I < 3q. Schedule H^ in interval Ij., 1 < k < q. Consider tasks 
D(ij\, Du t 2), ■■■ , D(i, s (ai)) scheduled in interval Ij (j < q due to how we scheduled 
the sets H k ). The tasks £>(;, s ( ai )+i), -D( 8 >(a,)+2)) ••• , -D(»(a,)+«-i) can be scheduled 
during the next q — 1 intervals. The additional empty time slots above the B needed 
to schedule Hj allow these 'pass-through' tasks to be scheduled in these intervals. By 
scheduling them thus, The tasks D( itS ( a r )+q) , _D( 8> ( ai ) +g+1 ), . . . , _D( i)2s ( a ,) +g _i) can be 
scheduled in the interval Ij+ q - The resulting schedule is feasible with a makespan of z. 

Conversely, suppose that we have a schedule of length z. As before, the tasks 
in chain X must be scheduled as soon as possible, and the remaining Yij tasks are 
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dependent on any node £ E(t + 1) except Tj. This is because at / time units away, there 
is one slot corresponding to each slot at time t + 1. Note that 7 can not be an empty 
time slot. If it were, then we could interchange Tj and TJ and interchange s 8 - 1 and 7. 
The resulting schedule would have a lower mean flow time contradicting our assumption 
on the optimality of the original schedule. 

We may now interchange TJ and TJ and interchange s 8 - 1 and 7. We repeat 
this analysis for s;^, Sj,3, • • • until no interchange is required. Since l(Tj) > /(TJ), an 
interchange is always feasible until s 8 ' a £ E(t + al + 6) for some a and 6 > 1. Then, no 
further interchange is required. Observe that Si yX = Sj y for some x and some y with y > 
x. The interchange will always terminate at some point because s(si yX -i) < s(sj y -i). 
■ 

Theorem 3.1 Hu's algorithm optimally schedules instances of the P\intree; pj = l',lj t k = ^\^2Cj 
problem. 

Proof Follows from Lemma 3.1 and Lemma 3.2. ■ 

3.1.2 P|outtree;pj = l;l j)k = 1| ^ Cj 

Bruno et. al. [BJS80] proved that a critical path based algorithm optimally solves the problem 
P\outtree;pj = l',lj t k = /|C ma x- The critical path algorithm is similar to Hu's algorithm and may 
be simply stated as: at each time step schedule those ready tasks with the largest label. The label 
of task Tj , l(Tj), is defined to be l(Tj) = max{Nj 1 , ... , Nj k } + 1, where Nj 1} ... , Nj k are the 
labels of the immediate successors of task Tj . The label of a task with no immediate successors is 1. 
The proof of optimality relies on the analysis of congestion regions, i.e., regions where more than 
m tasks are ready to execute during a time slot, where m is the number of machines. Bruno et. al. 
were able to prove that tasks that are not scheduled in a congestion region are scheduled as early 
as possible, i.e., given an infinite number of processors, these tasks could not be scheduled any 
earlier. They also proved that congestion regions do not contain any idle time slots, except possibly 
during the last time slot of the region. Finally, they proved that a critical path schedule during 
one congestion region does not induce idle time slots in a critical path schedule during another 
congestion region. Consequently, the proof given in [BJS80] is valid for the following theorem and 
is not repeated here. 

Theorem 3.2 The critical path algorithm optimally schedules instances of the P\outtree;pj = 
1; /jj, = /| ^2C'j problem. 

3.2 l|prec;pj = l;l j)k = l|C max ,ECj 

We now consider the problems in the previous section but with arbitrary precedence constraints, 
namely l\prec;pj = l',lj t k = /|C max and l\prec;pj = l',lj t k = A^Cj. Unfortunately, allowing 
arbitrary precedence constraints makes the problems strongly W'T-'-complete. 

We reduce the 3-Partition problem to l\prec;pj = l',lj t k = /|C max in a manner similar to the 
reductions used for chain structured tasks. Precedence constraints are used to create a specific 
template structure in a similar manner to the way the zero distance constraints and arbitrary 
execution times are used in the reductions involving chain structured tasks. 

Theorem 3.3 l\prec;pj = l',lj t k = /|C max is AfV -complete in the strong sense. 

Proof l\prec;pj = 1; Ijk = /|C max is clearly in AfV. To prove that it is also strongly AfV-h&rd, 
we reduce the 3-Partition Problem to it. 
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3.1 l|tree;pj = l;lj )k = l|C max ,£Cj 

We first consider the unit execution time scheduling problem where all distance constraints are 
equal and the precedence constraints form a tree. When the precedence constraint topology is 
an intree, the makespan problem P\intree;pj = l',lj t k = '|C max is solvable in polynomial time 
by Hu's algorithm [Li77]. When the precedence constraint topology is an outtree, the makespan 
problem P\outtree;pj = 1; Ijk = /|C max is solvable in polynomial time by using a critical path based 
algorithm [BJS80]. 

We show that Li's results [Li77] for the makespan objective function are easily extended to the 
mean flow time problem P\intree;pj = 1; Ijk = l\^2 Cj. We also show that the proof of optimality 
for the makespan problem for outtrees given by Bruno et. al. [BJS80] is valid for the mean flow time 
problem P\outtree;pj = l',lj t k = A^2^j- 

3.1.1 P|intree;pj = l;lj,k = 1| ^2 Cj 

We prove that the problem P\intree;pj = l',lj t k = ^\^2Cj is solvable in polynomial-time by Hu's 
algorithm [Hu61]. Our proof of optimality closely follows that of Li [Li77] for the makespan objective 
function. 

Hu's algorithm may be simply stated as: at each time step schedule those ready tasks with the 
largest label. The label of task 7) , l(Tj), is defined to be l(Tj) = Nj + 1, where Nj is the label of 
the immediate successor of task 7) . The label of a task with no immediate successor is 1. 

We first make an observation on the structure of an optimum schedule. 

Lemma 3.1 In an optimum schedule to P\tree;pj = 1; Ijk = /| ^Cj, no machine is left idle during 
a time slot in which a task is ready to be scheduled. 

Proof By contradiction. Assume that we have an optimum schedule that contains an idle time 
slot, t, during which a task 7) was ready to execute. Since 7) is ready during time slot t 
but scheduled at some time greater than t, we may reschedule 7) in time slot t without 
affecting the feasibility of the schedule and without rescheduling any other tasks. The 
rescheduled completion time of 7) is less than its original value. Thus, we have found 
a schedule with a smaller mean flow time contradicting our assumption of an optimum 
schedule. ■ 

The proof of the following lemma is identical to the one provided by Li [Li77]. It is repeated 
here for completeness. 

Lemma 3.2 At any time t, in generating a schedule for P\tree;pj = l',lj t k = ^\^2Cj, for the 
remaining tree of n nodes to be optimally scheduled, those tasks with the highest labels should be 
scheduled first. 

Proof Let s(j) be the scheduled time of task 7) . E(t) is defined to be the set of tasks scheduled 
at time t. Sji is the immediate successor of task 7) , and Sj^ is the immediate successor 
of task Sji-i. 

By induction. For n = 1, the lemma trivially holds. Assume that the lemma is 
true for n < k — 1. Then, for n = k suppose there exists an optimum schedule where 
there is a ready task at time t,Tj ^ E(t), but l(Tj) > l(Ti) for some 7} £ E{t). In case 
there is more than one such 7], choose the one with the lowest label value. 

By our induction hypothesis 7) £ E(t + 1). If s 8) i £ E(t + 6) for some 6 > /, then 
Tj and 7} may be interchanged in the schedule without affecting feasibility or optimality. 

If s iyl £ E(t + l), then s jyl £ E(t + l+6) for some 6 > 1. Then 3 7 £ E(t + l+l) 
such that 7 is either not dependent on any node £ E(t + 1) or Sji where Sji is not 
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Table 3: Complexity boundary involving multiple machine chain scheduling problems and the 
makespan and mean (weighted) flow time objective functions. 

Theorem 2.10 ([DLY91]) Pm\cham\ £ WjCj is strongly AfV- complete. 

Theorem 2.11 ([DLY91]) Preemption cannot reduce the mean weighted flow time for a set of 
chains. 

Theorem 2.12 ([DLY91]) Pm\chain; pmtn\^2wjC'j is strongly AfV -complete. 

We now show that _Pm|chain;j>j = 1| ^WjCj is strongly W"'P-complete. 

Theorem 2.13 Pm\chain;pj = 1| £wjCj is strongly AfV -complete. 

Proof Consider the problem _Pm|chain;pmtn| £ WjCj. Without loss of generality we assume 
that all tasks in _Pm|chain;pmtn| £ WjCj have integral processing times, p j . We rep- 
resent each task 7) £ T as a chain of pj unit execution time tasks Cj t \ -<■■■-< Cj tP -. 
To preserve the original precedence constraints, we add precedence constraints C'i tPt -< 
Cj t \ V Ti -< Tj in the original problem. 

With each task Cj tP - we associate the weight w j . The weight for all other tasks 
is set to zero. 

This reduction is performed in pseudo-polynomial time since the number of 
tasks in the reduced problem is equal to Eft- 

Solving _Pm|chain;j>j = 1|£wjCj clearly finds a feasible schedule to _Pm|chain;pmtn| 
^2 WjCj. By Theorem 2.11 the optimum schedule to _Pm|chain;j>j = M^WjC'j is 
an optimum schedule to _Pm|chain;pmtn| £ WjCj . Therefore, solving _Pm|chain;j>j = 
1|E w jCj solves _Pm|chain;pmtn| £ WjCj. Thus, _Pm|chain;j>j = M^WjC'j is strongly 
A/'P-complete. ■ 

Table 3 presents the known boundary involving multiple machine unit execution time chain 
scheduling problems and makespan, mean flow time, and weighted mean flow time objective func- 
tions. 

3 Arbitrary Precedence Structured Tasks 

We now examine the complexity of several problems involving tree and prec precedence constraint 
topologies. We show that for the makespan, C max , and mean flow time, £ Cj , objective functions the 
problems involving tree structured precedence constraint topologies are polynomial-time solvable, 
and for the problems involving prec structured precedence constraint topologies are strongly AfV- 
complete; therefore, they are strongly W"'P-complete for all of the objective functions shown in 
Figure 1. 
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Table 2: Complexity boundary involving single machine chain scheduling problems and the makespan 
and mean flow time objective functions. 

2.5 Complexity Boundary Analysis involving Chains 

We have proven the strong W"'P-completeness of several single machine problems involving chain 
structured tasks. By extension, the multiple machine versions of these problems are also strongly 
A/'P-complete. These results lead naturally to the question of where is the boundary between 
polynomial time solvable problems and W"'P-complete problems. 

Table 2 presents the known boundary for the single machine case with respect to the makespan 
and mean flow time objective functions. Polynomial time solvable problems are denoted by a 'p.' 
Strongly W"'P-complete problems are denoted by a '*.' And, problems with an unknown complexity 
are denoted by a '?.' Note that the complexity results obtained in this section are all new minimal 
W''P-complete results. 

A graphical representation of this boundary with respect to allowable task processing times and 
distance constraints is shown in Figure 2. The graph in Part (a) of the figure depicts the boundary 
for the makespan objective function, and the graph in Part (b) of the figure depicts the boundary 
for the mean flow time objective function. 

We now examine the complexity boundary involving multiple machine problems and problems 
that do not involve distance constraints. In order to more fully delineate the boundary we present 
one additional A'P-completeness proof for a parallel machine scheduling problem that does not 
involve distance constraints, namely Pm\chain;pi = M^WjC'j for any fixed m with m > 2. Du, 
Leung, and Young proved that P m\chain; pmtn\ ^WjCj is strongly A'P-complete by showing that 
preemption can not improve the mean weighted flow time to Pm\chain\ ^2 WjCj [DLY91]. This 
result suggests that requiring all processing times to be equal to one time unit, i.e., /3s is set to 
Pj = 1, does not reduce the complexity of the problem. We prove this to be the case after stating 
the main results obtained in [DLY91]. 
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characterized by the following function 



empty(Ii) = < 



1 



J(B 
J(B 



6)- 
6)- 
6)- 



5 
B 
B 



< 3 < Q ~ 1 
< 3 < q - 1 
< j < q ~ 1 



k + j(B + 6) + 1, 1 < k < B, 0<j<q-l 



< j < q ~ 1 
< j < q ~ 1 
< j < q ~ 1 




By assumption of a 'Yes' instance of 3-Partition, there exists q disjoint 3-element 
sets, Hi, H2, ■ ■ ■ , H q , with each element a,j corresponding to chain Cj. Schedule the 
corresponding chains of the elements in Hi during the first 3 + 5 non-full intervals. Note 
that of these 3 + 5 intervals, each of the first 5 intervals has one idle time unit and each 
of the last three intervals has 2 idle time units. The tasks corresponding to Hi consist 
of 5 tasks with pj = 1 and three tasks with pj = 2. The precedence constraints are such 
that the tasks corresponding to Hi may be scheduled in the first 3 + 5 non-full intervals. 
(A non-full interval is an interval that contains a non-zero amount of unused processing 
time after all chains X{ and Y are scheduled.) Similarly, the tasks corresponding to H^ 
may be scheduled in the next 3 + 5 non-full intervals, and so on. The resulting schedule 
is feasible and has a makespan of z. 

Conversely, suppose that we have a schedule of length z. As before, the tasks 
in chains X{, 1 < i ' < L — 1, must be scheduled as soon as possible, and the tasks from 
chain Y must be scheduled as soon as possible within the (6 + B)q + 1 intervals. The 
tasks from the Cj chains are scheduled without idle times in the remaining time slots 
of the schedule. Furthermore, tasks with an execution time of 1 are only scheduled in 
intervals containing a task of Y with an execution time of 1, and tasks with an execution 
time of 2 are only scheduled in intervals that do not contain a task from Y . 

Consider the first 3 + 5 non-full intervals. Each of the first 5 intervals contains 
a unit execution time task in a chain. By assumption on a schedule of length z, intervals 
Ib+2, Ib+4, and Ib+6 each contain a non-Y task with execution time of 2. Since all 
non-Y tasks with an execution time of 2 are the final tasks in the C chains, there must 
be three chains C'i, Cj, and C'k, that have their respective first s(ai), s(aj), and s(at;), 
tasks scheduled in the first 5 intervals. Therefore, chains C'i, Cj, and C'k are completely 
scheduled during the first 3 + 5 non-full intervals. C'i, Cj, and C'k correspond to set Hi 
containing the elements a 8 -, aj , and a^ such that |5i| = s(ai) + s(aj) + s(ak) = 5. An 
iterative application of this argument over each of the q sets of 3 + 5 non-full intervals 
leads to the identification of sets Hj with \Hj\ = 5 for 1 < j < q and to the conclusion 
that we have a 'Yes' instance of 3-Partition. ■ 

The complexity proof for the mean flow time objective function is identical to the complexity 
proof for the makespan objective function. This is because the reduction forces the tasks to have a 
known sum of completion times. There is no variability in the sum of completion times if there is a 
'Yes' solution to the 3-Partition problem. Therefore, by replacing z in the complexity proof for the 
makespan objective function with the sum of the completion times of chains Xi, 1 < i < L — 1, Y, 
and Cj, V aj £ A, the proof remains valid for the mean flow time objective function. 

Theorem 2.9 l\chain;pj £ {1, 2}; Ijk = L (L > 2)| ^Cj is AfV-complete in the strong sense. 
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Theorem 2.8 l\chain;pj £ {l,2};ljj c = L (L > 2)|C max «s AfV-complete in the strong sense. 

Proof This problem is clearly in AfV. To prove that it is strongly Af V-h&rd we reduce the 
3-Partition Problem to it. 

Given an instance of the 3-Partition Problem, we construct an instance of the 
l\chain;pj £ {l,2};ljj c = L (L > 2)|C max problem as follows. There are [(6 + B)q + 
l](L — 2) + 3</(3 + 5) + 1 tasks. For each aj £ A, there is a chain Cj consisting of s(aj ) + 1 
tasks, Cji -< (7^2 -^ ' ' ' -^ Cj>(a )+i; with processing times pj^ = 1, 1 < A; < s(aj), 
and Pj, s ( ai )+i = 2. 

_L additional chains, A";, 1 < i ' < L — 1, and Y, are created. A 8 -, 1 < ? ' < L — 1, 
contains (6 + B)q + 2 tasks each with processing time 1. Y contains (3 + B)q + 1 tasks 
with processing times of 

C 2 i = 

KY)=< 1 J(3 + 5) + l<i< j(3 + 5) + 5,0< j<g-l 

[ 2 j(3 + 5) + B + 1 < i < j(3 + 5) + B + 3, < j < q - 1. 

All distance constraints are equal to a constant L > 2. 

We define z = [(6 + 5)g + 1]_L + L — 1. Note that the sum of the processing 
times of all tasks is equal to z; therefore, any schedule that completes by time z must 
not have any idle time. Figure 6 illustrates how the template chains X{ and the enforcer 
chain Y create a template within which the Cj chains must be scheduled. It is easy to 
verify that this reduction requires time polynomial in the parameters of the 3-Partition 
problem. 



Figure 6: Template formed from the X{ chains and the Y chain in the proof of Theorem 2.8. X^ in 
the figure corresponds to the set of tasks Xi^, 1 < i < L — 1. The shaded regions indicate unused 
processing times after chains X{, 1 < i : < L — 1, and Y have been scheduled. 



Suppose that there exists a 'Yes' solution to the 3-Partition Problem. Schedule 
the tasks of chains X,;, 1 < i < L — 1, as soon as possible. Xi( 6+ g\ q+ 2 finish at times 



[z — L + 1, z\. We have (6 + B)q + 1 intervals, I\, 1^ 



I, 



(6+B)q + l 



each of length 2 



within which the remaining tasks can be scheduled. Chain Y has (3 + B)q + 1 tasks; 
however, due to the processing times of the tasks in X{ and the distance constraints 
there are 3g intervals during which a task from Y cannot be scheduled, e.g., the interval 
between tasks Xi t B+e and Xi t B+r Vi. Therefore, we must schedule the tasks of Y as 
early as possible. After scheduling chain Y the empty time slots in each interval J; are 
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be the first task to complete an integral amount of processing after time t{ in S . Let 
tj > t{ be the time slot in which this unit of C completes. We may move this unit of 
processing time of C to time slot tk where t{ < tk < tj and tk is the earliest time slot 
not already containing a fixed unit of processing time in which this unit of C may be 
scheduled. Fix this unit of C in time slot t^. Note that due to distance constraints tk 
may not be equal to t{. 

All non C tasks that were scheduled in the interval [tk,tj] in S may be 'com- 
pressed' into the interval [tk + l,tj] where the compression does not change the scheduled 
time of fixed task segments. The compression maintains the ordering of the compressed 
tasks in the schedule, and it does not change the completion times of any task in the 
schedule S except possibly decreasing the completion time of task C . Thus, the modified 
schedule is still optimal. 

By iteratively finding a time slot t{ and a task C , we construct the schedule S' 
that contains preemptions only at integral boundaries, and the lemma is proven. ■ 

The complexity results for the makespan scheduling problem containing chains, unit execution 
time tasks, and all distance constraints equal to either zero of / for some / input to the problem 
follow directly from Theorem 2.3 and Lemma 2.1. 

Theorem 2.5 l\chain;pj = l;lj t k G {0,/}|C max is AfV -complete in the strong sense. 

Proof Follows from Theorem 2.3 and Lemma 2.1. ■ 

Likewise, the complexity for the mean weighted flow time objective follows from Theorem 2.4 
and Lemma 2.1. 

Theorem 2.6 l\chain;pj = l;lj t k G {0,/} | ^WjCj is AfV -complete in the strong sense. 

Proof Follows from Theorem 2.4 and Lemma 2.1. ■ 

However, we can do better than this. The complexity for the mean flow time objective function 
follows from the complexity for the makespan when all processing times are equal to one. 

Theorem 2.7 l\chain;pj = l;lj t k G {0,/}| £Cj is AfV -complete in the strong sense. 

Proof We restrict the problem to instances where the optimum schedule S op t contains no 

idle time. The sum of the completion times for S op t is X2iJT* * = Yl7=i * = \ n { n ~\~ 1). 
Solving for the optimum sum of completion times is equivalent to solving for the optimum 
makespan. The theorem then follows from Theorem 2.5. ■ 

2.4 l|chain; Pj £ {l,2};l j>k = L (L > 2)|C max ,£Cj 

We now turn our attention to the case when the values of the distance constraints are no longer 
part of the input to the problem, but instead are a fixed value. We consider the problem when 
all distance constraints are equal to a fixed value L. When L = 0, l|/?i; fa', ^3; Uj = 0|C max ,£Cj 
are solvable in polynomial time. When L = 1, l\fa;fa;fa;lij = l|C max are solvable in polynomial 
time; however, the complexity of many of these problems is unknown for the sum of completion 
time objective function £ Cj . We consider problems for which L > 2 and fa = chain. We further 
restrict the problem to contain only tasks with processing times of either 1 or 2. In other words, 
we examine the complexity for l\chain;pj G {1,2};/^ = L (L > 2)|C max ,£Cj. We prove that 
l\chain;pj G {1,2};/^ = L (L > 2)|C max ,£Cj are strongly W"'P-complete problems. These results 
are stronger than the results from Section 2.1. 
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these intervals. By scheduling them thus, Ctj( q+ iy\ can be scheduled in the interval 
Ii+q. The resulting schedule is feasible with a mean flow time < z. 

Conversely, suppose that we have a schedule with a mean flow time < z. As 
before, the tasks in chain X must be scheduled as soon as possible, and exactly one task 
from chain Y must be scheduled in each interval /;, 1 < i < I 5 + 2q. The tasks from the 
Cj chains are scheduled without idle times in the remaining time slots of the intervals. 
If this is not the case, then there must be at least one task, XJ%, with completion time C'k 
> (I 5 + 2q + 1)5 > X2i=iP^'/(')] contradicting our assumption on the mean flow time 
of the schedule. Furthermore, due to the distance constraints and lengths of the chains 
Cj, the first tasks from these chains, Cfju, 1 < j < 3g, must be scheduled in the first q 
intervals /;, 1 < i < q, and the last tasks from these chains, Ctj( q+ iy\, 1 < j < 3g, must 
be scheduled in the last q intervals /;, q + 1 < i ' < 2q. 

Consider the first interval 7i and only the first tasks in the chains Cj . Let Si be 
the set of C(j,i) tasks that are started and finished in I-\_. Task Y\ requires 3g — 2 time units 
in Ii leaving 5 time units to schedule other tasks. Assume that £ c s pj = B — c 
for some c > 0. Consider the interval I q +i. Y q+ i has an execution time of 1. There 
are at most 3g unit execution time tasks from the chains Cj that may be scheduled in 
this interval. The only additional tasks that may be scheduled in this time interval are 
the final tasks of the chains Cj whose first task was scheduled in the first interval 7i 
such that C(j,i) G Si . |Si| < 3 by the constraints in the 3-Partition Problem, and c is 
characterized by the following function due to our assumptions on 5 and s(aj), a,j G A. 



c> < 



Therefore, there are c — 3 + |Si| > idle time slots in interval I q +i. It follows that 
our assumption is incorrect, and ^2 C s pj = B. An iterative application of this 
argument leads to the identification of sets Sj with ^2 C s pj = B for 1 < i I < q and 
to the conclusion that we have a 'Yes' instance of 3-Partition. ■ 

2.3 l|chain; Pj = l;l j>k £ {0, l}|C max , £ Cj 

The strong A'P-completeness results for the preemptive version of the problem suggests that if the 
problem is restricted to have integral processing times and distance constraints, then preemptions 
may occur at integral boundaries only without affecting the complexity of the problem. The following 
lemma formalizes this observation. 

Lemma 2.1 If all input parameters are integral valued, then there exists an optimal solution to 
l|/?i; pmtn; /3s; (3$ \ C ma x,X2Cj such that all preemptions occur at integral time boundaries. 

Proof We will prove this lemma by constructing an optimal solution S' containing preemptions 
only at integral time boundaries from an optimal solution S that may have preemptions 
at non-integral time boundaries. The solution S' will be constructed by iteratively 
'fixing' a unit of execution time from a single task into a time slot. Once a unit of 
execution time is 'fixed' at a particular time slot it will remain at that time slot for the 
remainder of the iterations. 

Let S be an optimal solution to a problem in l|/?i; pmtn; /3s; (3$ \ C max , ^2Cj. 
Let ti be the first time slot in S that has more than one task scheduled in it. Fix the 
schedule through time slot £;_i. Considering only non-fixed task segments, let task C 
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For each aj £ A there is one chain Cj containing q + l tasks, Cfju, C(j,2); • • • ; C(j,(a+ 1 ))' 
The processing times for C(j,i) and Cyr^+i)) are s(aj). All other processing times for 
the tasks in Cj are equal to 1. 

Two additional chains, X and Y, are created. X contains I 5 + 2q + 1 tasks each 
with processing time /. Y contains I 5 + 2q tasks with processing times of 



3q- 


- 3i + 1 


for 1 < % ' < q 


3i- 


-3g-2 


for g + 1 < i < 2q 




/ 


for 2q + 1 < i < I 5 + 2q 



p(Yi) = 

All distance constraints are equal. We define the non-zero distance constraint 
to be 

1 = B + 3q-2. 

The target mean flow time for the schedule is 

l 5 +2q + l l 5 +2q 2q 

z = J2 ['(2i - 1)] + J2 [2/i] + ^[2h7(0], 

8 = 1 i = 2q + l 8 = 1 

where f(i) is defined as 

tC\-l 4 + [(3? - 2) - (3g - 3i +1)] 1 < i < q 
I[%} \ 4 + [(3g - 2) - (3i - 3g - 2)] g + 1 < i < 2q 

It is easy to verify that this reduction requires time polynomial in the parameters of the 
3-Partition problem. 

Suppose we have a 'Yes' instance of 3-Partition. A schedule with mean flow 
time < z is constructed as follows. Start the tasks of chain X as soon as possible. Thus, 

l 5 +2q + l l 5 +2q + l 

E C X ,= ^ [/(2i-l)]. 

8=1 8=1 

We have I 5 + 2q intervals, I\, Lj, ■ ■ ■ , Ii$+2q, each of length / within which the remaining 
tasks must be scheduled. Chain Y has I 5 + 2q tasks; therefore, one task of chain Y must 
be executed during each interval. Furthermore, the distance constraints require that no 
more than one task of Y may be executed during any one interval. Thus, schedule Y 
in interval /;. It is easily seen that if the first 2q tasks of A and Y are not scheduled as 
above, then the schedule will have a mean flow time > z. After scheduling chain Y, the 
empty time slots in each interval J; are characterized by the following function. 



B + 3(i - 


- 1) 1 < i < q 


3 + 3(2g- 


-i) q + 1 < i < 2q 




2q + 1 < i < I 5 + 2q 



empty(Ii) 



By assumption of a 'Yes' instance of 3-Partition, there exists q disjoint 3-task 
sets, Hi, H2, ■ ■ ■ , H q , with processing time of B and comprised of the first task from the 
chains Cj, 1 < j < 3g. Schedule H^ in interval Ij., 1 < k < q. Consider task C(j,i) 
scheduled in interval I{ (i < q due to how we scheduled the sets Hk). The tasks Cfj t 2), 
C(j,3), • • • , Ctj A \ can be scheduled during the next q — 1 intervals. The additional 
empty time slots left by chain Y allow these 'pass-through' tasks to be scheduled in 
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the chains Cj, 1 < j < 3g. Schedule _ff^ in interval 1^,1 < A; < q. Consider task 
Cfju scheduled in interval J; (i < q due to how we scheduled the sets Hk). The tasks 
Cfj t 2), C(j,3); • • • , C(j,g) can be scheduled during the next </ — 1 intervals. The additional 
empty time slots left by chain Y allow these 'pass-through' tasks to be scheduled in these 
intervals. By scheduling them thus, Ctj( q+ iy\ can be scheduled in the interval Ii+ q . The 
resulting schedule is feasible with a makespan of z. 

Conversely, suppose that we have a schedule of length z. As before, the tasks 
in chain X must be scheduled as soon as possible, and exactly one task from chain Y 
must be scheduled in each interval /;, 1 < i ' < 2q. The tasks from the Cj chains are 
scheduled without idle times in the remaining time slots of the intervals. Furthermore, 
due to the distance constraints and lengths of the chains Cj , the first tasks from these 
chains, Cfju, 1 < j < 3g, must be scheduled in the first q intervals /;, 1 < i ' < q, and 
the last tasks from these chains, Ctj( q+ iy\, 1 < j < 3g, must be scheduled in the last q 
intervals /;, q + 1 < i ' < 2q. 

Consider the first interval 7i and only the first tasks in the chains Cj . Let Si be 
the set of C(j,i) tasks that are started and finished in I-\_. Task Y\ requires 3g — 2 time units 
in Ii leaving B time units to schedule other tasks. Assume that ^2 C s pj = B — c 
for some c > 0. Consider the interval I q +i. Y q+ i has an execution time of 1. There 
are at most 3g unit execution time tasks from the chains Cj that may be scheduled in 
this interval. The only additional tasks that may be scheduled in this time interval are 
the final tasks of the chains Cj whose first task was scheduled in the first interval 7i 
such that C(j,i) G Si . |Si| < 3 by the constraints in the 3-Partition Problem, and c is 
characterized by the following function due to our assumptions on B and s(aj), a,j G A. 



c > < 



Therefore, there are c — 3 + |Si| > idle time slots in interval I q +i. It follows that 
our assumption is incorrect, and ^2 C s pj = B. An iterative application of this 
argument leads to the identification of sets Si with ^2 C s pj=B for 1 < i I < q and 
to the conclusion that we have a 'Yes' instance of 3-Partition. ■ 

As in the previous section, the proof for l\chain;pmnt;ljk = l\^2Cj follows the proof for 
l\chain;pmnt;ljk = /|C max . This is achieved by making the template chain X and the enforcer 
chain Y much longer than the chains corresponding to the elements of A. The increased lengths 
of X and Y ensures that they are scheduled as soon as possible except possibly at the end. The 
chains corresponding to the elements of A must then be scheduled without idle times in the first 2q 
intervals caused by X if there is a 'Yes' instance of 3-Partition. Scheduling a task from these chains 
after all tasks in X (or most tasks in X) causes the mean flow time of the resultant schedule to be 
greater than the target value z. 

Theorem 2.4 l\chain;pmnt;ljk = l\^2Cj is NV- complete in the strong sense. 

Proof l\chain;pmnt;ljk = l\^2Cj is clearly in AfV . To prove that it is also strongly AfV-h&rd, 
we reduce the 3-Partition Problem to it. 

Without loss of generality we assume that 5^3 and that s(aj) ~^> 3, V aj G A. 
If this is not the case, then we may scale B and s(aj), V aj G A, by a constant without 
affecting the 3-Partition Problem. Given an instance of the 3-Partition Problem, we 
construct the following instance of l\chain; pmnt; Ijk = /| ^2Cj. There are 3</ + 2 chains. 

10 



B 


if |Si| = 


5/2+1 


if |5i| = l 


2 


if |5i| = 2 


3 


if |5i| = 3 



X, 



p ,.,= s < a p 



p ,= ] 



P ,3, = 1 



P 0,4,= l 



P =1 
09-1) 



P o,) =1 



P =s(a.) 

O.q+0 v v 



• • • x 



■2,-3 ^q-3 



Figure 5: Template formed from the X chain and the Y chain in the proof of Theorem 2.3. This 
template limits the scheduling structure of the Cj chains. 



Two additional chains, X and Y , are created. X contains 2q + 1 tasks each with 
processing time / (defined below). Y contains 2q tasks with processing times of 



p(Yi) 



3q — 3i + 1 for 1 < % ' < q 

3i - 3q - 2 for q + 1 < i < 2q 



to be 



All distance constraints are equal. We define the non-zero distance constraint 

1 = B + 3q-2. 



The deadline for the schedule is z = Aql + /. Note that the processing times of 
all the tasks is equal to Aql + /; thus, any schedule that completes before the deadline 
must not have any idle time. Figure 5 illustrates how the template chain X and the 
enforcer chain Y create a template within which the Cj chains must be scheduled. It 
is easy to verify that this reduction requires time polynomial in the parameters of the 
3-Partition problem. 

Suppose we have a 'Yes' instance of 3-Partition. A schedule of length z is 
constructed as follows. Start the tasks of chain X as soon as possible. Xt2q+i) finishes at 
time z. We have 2q intervals, I\, I2, ■ ■ ■ , I'zq, each of length / within which the remaining 
tasks must be scheduled. Chain Y has 2q tasks; therefore, one task of chain Y must 
be executed during each interval. Furthermore, the distance constraints require that no 
more than one task of Y may be executed during any one interval. Thus, schedule Y{ 
in interval I{. After scheduling chain Y , the empty time slots in each interval I{ are 
characterized by the following function. 



empty(Ii) 



B + 3(i - 1) 1 < i < q 

B + 3(2g -i) q + l<i<2q 



By assumption of a 'Yes' instance of 3-Partition, there exists q disjoint 3-task 
sets, Hi,H'2, ■ ■ -,H q , with processing time of B and comprised of the first task from 



Then, 



ELiQ > Ef=iQ 



> Ej=i[i3?5 + (i - 1)5] + £f =?+1 [J3<zB + (i - 1)5 + 1] 
= z + (M-g)-El = i[3(3g5 + 5)i 

> z + (M - q) - l(3qB + B)q(q + I) 
= z + (M-q)-\(M-q) 



3 

4 

> 2, 



which is a contradiction. Thus, the tasks of the chain X must be scheduled as early as 
possible. 

In order for the mean flow time of the schedule to be at most z the tasks 7), 
1 < j < 3g, must be scheduled before task X q+ \. Since there are exactly 5 time units 
between each of the tasks in X and preemption is not allowed, each set of tasks Si 
scheduled between tasks X 8 _i and T{, 2 < i < q + 1, must have exactly B units of 
execution time. Furthermore, since 5/4 < pj < 5/2 VI) there must be at least three 
tasks in each set Si. The fact that there are exactly q sets Si requires that there be 
exactly three tasks in each set Si. Thus, there exists a partition into q 3-element subsets 
Si, such that E?ps Pj = E?ps s ( a j) = 5 (1 < j < q), and the schedule is witness to 
such a partition. Thus, we have a 'Yes' instance of 3-Partition. ■ 

2.2 l|chain;pmnt;l j)k = l|C max ,ECj 

A slightly 'easier' version of the above problems allows preemption in a feasible schedule. The 
A'P-completeness proofs for the non-preemptive problems do not hold when preemption is allowed. 
Therefore, we must reexamine their complexity in light of preemption. 

Balas et. al.[BLV95] were able to prove that when preemption is allowed and the distance con- 
straints are restricted to a set of two values that are inputs to the problem, the problem with the 
makespan objective function is strongly A'P-complete. We improve upon this result by showing 
that preemption does not improve the complexity of the problem when all distance constraints are 
equal to the same value, i.e., l\chain; pmnt; Ijt, = /|C max , ^2C'j are strongly A'P-complete. 

The A'P-completeness proofs are very similar to those for the non-preemptive problems. The 
main differences are that each element of A corresponds to a chain of tasks instead of a single 
task and the template chains form 2q intervals within which these tasks may be scheduled. The 
key idea is that the schedule from the first q intervals (corresponding roughly to the schedule in 
the non-preemptive proofs) constrains the possible schedules in the second q intervals (the mirror 
schedule). 

Theorem 2.3 l\chain;pmnt;ljk = /|C max is NV- complete in the strong sense. 

Proof l\chain; pmnt; Ijt, = /|C max is clearly in AfV. To prove that it is also strongly AfV-h&rd, 
we reduce the 3-Partition Problem to it. 

Without loss of generality we assume that 5^3 and that s(aj) ^ 3, V aj £ A. 
If this is not the case, then we may scale 5 and s(aj), V aj £ A, by a constant without 
affecting the 3-Partition Problem. This constraint is required to ensure the validity of 
the transformation. 

Given an instance of the 3-Partition Problem, we construct the following in- 
stance of l\chain; pmnt; ljj~ = /|C max . There are 3q 2 + Iq + 1 tasks arranged in 3q + 2 
chains. For each aj £ A there is one chain Cj containing q + 1 tasks, Cfju, C(j,2); • • • , 
Cfj( q+ iy\. The processing times for C(j,i) and Cfj( q+ iy\ are s(aj). All other processing 
times for the tasks in Cj are equal to 1. 



q intervals each containing 5 time units, each of the q sets Si may be scheduled within 
one interval for a total schedule time of 2qB + B. 

Suppose that a schedule of length z exists. The chain X requires a minimum 
time of 2qB + B to complete. Since there are exactly B time units between each of the 
tasks in X in a schedule that completes in time 2qB + B and preemption is not allowed, 
each set of tasks Si scheduled between tasks X 8 _i and X{ must have exactly B units of 
execution time. Furthermore, since 5/4 < pj < 5/2 VI) there must be at least three 
tasks in each set Si. The fact that there are exactly q sets Si requires that there be 
exactly three tasks in each set Si. Thus, there exists a partition into q 3-element subsets 
Si, such that Etp,s Pj = Etp,s s ( a i) = B (1 — ' — ?)> anc ^ the schedule is a witness 
to such a partition. Thus, we have a 'Yes' instance of 3-Partition. ■ 

The reduction for the mean flow time objective function is essentially identical to the reduction 
for the makespan objective function. The main difference is that the template chain X has a large 
number of extra tasks in it. Combined with the appropriate cost z, these extra tasks force the tasks 
in chain X to be scheduled as soon as possible. A large number of extra tasks are required in X 
since the exact finish times of the tasks corresponding to the elements of A are not known. The sum 
of the finish times of the tasks corresponding to the elements in A may only be bounded. Thus, 
without the additional tasks in chain X, it is possible to have a solution with mean flow time at 
most z when the corresponding 3-Partition problem does not contain a 'Yes' solution. 

Theorem 2.2 ([BK98]) l\chain;ljk = l\^2C'j is MV- complete in the strong sense. 

Proof This problem is clearly in AfV. To prove that it is strongly Af V-h&rd we reduce the 
3-Partition Problem to it. 

Given an instance of the 3-Partition Problem, we construct an instance of 
l\chain;ljk = l\^2C'j as follows. There are n = 3q + M tasks where M = 2(3</5 + 
B)q(q + 1) + q. For each aj £ A, there is a task 7) with processing time pj = s(a,j). For 
each i £ {1, . . . , M}, there is a task A) with processing time pi = 3</5. The tasks Xi 
form a chain X = X\ -< X^ -< ■ ■ ■ -< 1m with all distance constraints equal to 5. The 
tasks Tj have no precedence constraints associated with them. We define 

M q 
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Suppose that there exists a 'Yes' solution to the 3-Partition Problem. Since 
there exists a 'Yes' solution to the 3-Partition Problem, there exist q 3-element subsets 
Si such that ^2 TeS Pj =5. Since the chain X can be scheduled such that there exists 
M — 1 'holes' each containing 5 time units, the tasks in X are scheduled as soon as 
possible. Thus, each of the q sets Si may be scheduled within one hole. We schedule the 
sets Si in the first q holes. The resulting mean flow time of the schedule satisfies 

En ^ \-^.M ^ . \-^3q r ^ 
j=l °J - l^j = \ °J + 2^ = 1 ^M+i 

< EL $qBi + B(i - 1)] + ELi [3i(3g5 + &)] 



Suppose that a schedule with mean flow time at most z exists. The first q tasks 
of chain X must be scheduled as early as possible. Assume that this is not the case. 



A/'P-completeness proofs. The 3-Partition problem is stated as follows. 

Given a set of 3g elements A = {a\, 0,2, ■ ■ ■ , a,3q\, a bound 5, and a size s(a,j) for each 
dj £ A such that 5/4 < s(aj) < 5/2 and ^2 a ,. A s(a,j) = qB, can A be partitioned into 
q disjoint sets A\, A2, . . . , A q such that for 1 < k < q, ^ a . £i s { a j) = B? 

For each scheduling problem II in which this reduction is performed we show that for every 
instance of 3-Partition we can compute an instance of II and a value z in polynomial or pseudo- 
polynomial time such that the instance of 3-Partition has a 'Yes' solution if and only if the instance 
of II has a schedule of length at most z (for the makespan objective) or the sum of the completion 
times is at most z (for the mean flow time objective). The strong W"'P-completeness results follow 
from the fact that 3-Partition is W"'P-complete in the strong sense [GJ79]. 

2.1 l|chain;l j)k = l|C max ,£Cj 

The most basic chain structured tasks scheduling problems involving distance constraints are l\chain; Ij^ 
/|C max ,^Cj. The complexity of these problems was first examined by Balas et. al.[BLV95] and 
Brucker and Knust[BK98] respectively. Balas et. al. were able to prove the strong W"'P-completeness 
of l\chain;lj t k = /|C max with a simple reduction from 3-Partition. Brucker and Knust extended 
this proof in a straight forward manner to handle the mean flow time objective function. The 
problems examined in the remainder of this section are restricted versions of these two problems 
(in the sense that they contain additional f3 constraints compared with these problems), and their 
respective W"'P-completeness proofs are very similar in structure to the simple W"'P-completeness 
proofs of l\chain;lj t k = /|C max ,^Cj. For this reason, we repeat the W"'P-completeness proofs for 
l\chain;l jtk = /|C max , J2 Q • 

Theorem 2.1 ([BLV95]) l\chain;lj y k = '|C max is NV- complete in the strong sense. 

Proof This problem is clearly in AfV. To prove that it is strongly Af V-h&rd we reduce the 
3-Partition Problem to it. 

Given an instance of the 3-Partition Problem, we construct an instance of the 
l\chain; lj t k = '|C max problem as follows. There are Aq + 1 tasks. For each a,j £ A, there 
is a task 7) with processing time pj = s(aj). For each i £ {0, . . . , q}, there is a task X{ 
with processing time pi = 5. The tasks X{ form a chain X = Xo -< X\ -< ■ ■ ■ -< X q 
with all distance constraints equal to 5. The tasks 7) have no precedence constraints 
associated with them. We define z = 2qB + 5. Note that the precedence constraints 
form a single chain and that the processing times of all the tasks is equal to z. Also, 
note that the chain X requires z time units to complete due to the distance constraints. 
Thus, any schedule that completes before the deadline, z, must not have any idle time, 
and the tasks in X must be scheduled as soon as possible. The template formed by X 
is shown in Figure 4. It is easily verified that this reduction requires time polynomial in 
the parameters of the 3-Partition Problem. 

B 3B 5 B 7B 2qB-5B 2qB-3B 2qB-B 2qB+B 

2B 4B 6B 2qB-6B 2qB-4B 2qB-2B 2qB 



x„ ,, X X, • • ■ x q . 3 



Figure 4: Template formed from the X chain in the proof of Theorem 2.1. 

Suppose there exists a 'Yes' solution to the 3-Partition Problem. Since there 
exists a 'Yes' solution to the 3-Partition Problem, there exist q 3-element subsets Si 
such that ^2j eS Pj = 5. Since the chain X can be scheduled in time 2qB + 5 leaving 
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Figure 3: Graphical representation of the known complexity boundary for unit execution time 
scheduling problems. Part (a) depicts the boundary for the makespan objective function. Part (b) 
depicts the boundary for the mean flow time objective functions. 
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Figure 2: Graphical representation of the known boundary for single machine scheduling problems 
involving chain precedence constraints. Part (a) depicts the boundary for the makespan objective 
function. Part (b) depicts the boundary for the mean flow time objective function. Nodes cor- 
responding to problems with new complexity bounds presented in this paper are boxed. Nodes 
corresponding to problems whose complexity was previously known are doubly circled. We use the 
representation given by [GJ79]. Problems are represented by circles, filled-in if known to be AfV- 
complete, empty if known to be in V , and dotted if their complexity is unknown. An arrow from 
III to II2 signifies that III is a subproblem of II2. 
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Table 1: Complexity results presented in this paper. 
A'P-complete. 



A '*' indicates the problem to be strongly 



A'P-complete in the strong sense 2 for both of these objective functions; therefore, the problems 
are A'P-complete in the strong sense for all of the standard objective functions shown in Figure 1. 
Table 1 summarizes our complexity results. 

These complexity results narrow the boundary between known polynomial time solvable problems 
and known A'P-complete problems. Figure 2 graphically illustrates the boundary with regard to 
possible task processing times and distance constraints on a single machine. Figure 3 graphically 
illustrates the boundary with regard to possible constraint topologies and distance constraints on a 
single machine. Each circle represents a problem. A filled-in circle represents a known A'P-complete 
problem. An empty circle represents a known polynomial-time solvable problem, and a dotted circle 
represents an unknown complexity for the problem. Problems with new complexity results obtained 
in this paper are marked with a square. Problems whose complexity was previously known are 
marked with a circle. 

The remainder of this paper is organized as follows. Section 2 presents the A'P-completeness 
proofs for several related problems with f3\ = chain and discusses the boundary between AfV- 
complete problems and polynomial time solvable problems with respect to them. Section 3 presents 
polynomial time scheduling algorithms for a simple problem involving f3\ = intree and f3\ = outtree 
along with the A'P-completeness results for several related problems with f3\ = prec. The boundary 
between A'P-complete problems and polynomial time solvable problems with respect to these prob- 
lems is also discussed. Finally, Section 4 summarizes the results of this work and discusses some 
related open problems. 

2 Chain Structured Tasks 



In this section we examine the complexity of several problems involving chain structured tasks, 
i.e., problems in which the precedence constraints form sets of chains. We show that for the 
makespan, C max , and the mean flow time, ^Cj, objective functions these problems are strongly 
A'P-complete; therefore, they are strongly A'P-complete for all of the objective functions shown in 
Figure 1. 

The 3-Partition problem is reduced to all of the various scheduling problems in their respective 



A problem is A^P-complete in the strong sense if it cannot be solved by a pseudo-polynomial time algorithm. Thus, 
a strongly A^P-complete problem cannot be solved in pseudo-polynomial time even if its parameters are bounded by 
a polynomial. 
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Figure 1: Complexity hierarchy for scheduling problems that differ only in their objective functions. 
An arrow from 71 to 72 indicates that a \/3 I71 ex a \/3 I72. 



We use the « |/5|t notation introduced in [GLLR79] to specify scheduling problems, a denotes the 
type of scheduling problem and the number of machines to be used. In the paper we only consider 
a G {1, P, Pm}. The single machine environment, a = 1, is the simplest machine environment 
that is a special case of all other more complicated machine environments, a = P indicates that the 
scheduling will be performed on an arbitrary number (given as input to the problem) of identical 
parallel machines, and a = Pm indicates that the scheduling will be performed on a fixed number 
m of identical parallel machines. /? denotes the constraints on the task characteristics and on a 
feasible schedule. We specify (3 using five parameters, /3\, fa, /?3, /?4, and /3s 1 . j3\ G {o, chain, 
intree, outtree, tree, prec} denotes the precedence constraint topology. /?2 G {o , pmtn} denotes the 
allowability of preemptions in a feasible schedule. /3s G {o, pj = 1, pj G Pj} denotes restrictions 
on the task processing times. /?4 G {o , rj} denotes the presence of nonzero release times, and 
/?5 G {o , lij = L = O(l), lij = /, lij G {0,/}} denotes restrictions on the distance constraints. 
lij = L indicates that all distance constraints are equal in value, and the value is a fixed constant. 
Whereas, lij = / indicates that all distance constraints are equal in value, but the value is given as 
an input to the problem, lij G {0, /} indicates that the distance constraint values may only be one 
of the values in the given set, i.e., or /. 7 denotes the objective function to be minimized. We only 
consider problems with 7 G {C max , E^jl- 

A complexity hierarchy describing the relationships between scheduling problems that differ only 
in their objective functions is shown in Figure 1. This hierarchy indicates that a problem with the 
objective function at the head of an arc reduces to a problem with the objective function at the tail 
of the arc, e.g., a \/3 |C max reduces to a \/3 \L max . Thus, problems with an objective function of C max 
and ^2 Cj are the simplest with respect to their objective functions, and determining that they are 
A'P-complete proves that the corresponding problem is A'P-complete for all objective functions in 
the complexity hierarchy. 

We examine the complexity of several single machine distance-constrained scheduling problems 
for both the C max and the ^2 Cj objective functions. We prove that most of these problems are 



The absence of constraints of type f3 t 
in the a|/5|7 problem statement. 



1 < « < 5, is denoted by a o. For clarity, these symbols are not included 



Complexity Results for Single Machine Distance Constrained 

Scheduling Problems 

MIT-LCS-TM-587 
Daniel W. Engels David Karger Srinivas Devadas 

dragonOcaa . lcs . mit . edu kargerOtheory . lcs . mit . edu devadasOcaa . lcs . mit . edu 

Laboratory for Computer Science 

Massachusetts Institute of Technology 

Cambridge, MA 02139 

November 1998 



Abstract 

Scheduling problems that involve timing constraints between tasks occur often in machine 
shop scheduling (e.g., job shop scheduling problems) and code scheduling during software com- 
pilation for pipelined processors (e.g., multiprocessor sequencing and scheduling problems). The 
timing constraints often take the form of start-start and finish-start time lags. We consider the 
case of non-negative finish-start time lags lij, i.e., distance constraints, that require a minimum 
time from the completion of task T t to the start of task Tj. New complexity results are derived 
for several single machine scheduling problems for both the makespan, C max , and mean flow 
time, ^2 Cj , objective functions. 

Keywords: scheduling, distance constraints, time lags, complexity theory 

1 Introduction 

We examine the complexity of several single machine scheduling problems in which distance con- 
straints (non-negative finish-start time lags) are associated with the precedence constraints. Distance 
constraint lij requires that there exist a minimum of lij time units from the completion time C; of 
task Ti to the start time Sj of task Tj, i.e., C; + hj < Sj. Distance constraints are a generalization 
of the classical precedence constrained scheduling problems where lij = 0. 

Distance constraints often arise in practice. For example, scheduling software on a pipelined 
functional unit that does not contain bypass circuitry requires that all instructions complete the 
last stage of the pipeline before their results are available to any other instruction. Therefore, if 
the pipeline has k stages, then all instructions that require the result from a previous instruction a 
must wait at least k cycles after instruction a enters the pipelined functional unit before entering 
the pipelined functional unit. This software scheduling problem may be made more difficult if some, 
but not full, bypass circuitry exists or different instructions require different numbers of pipeline 
stages to complete. Thus, the distance constraints may not all be equal. 

Recent work [BHH97] has shown that scheduling distance constrained tasks on a single machine 
is often at least as hard as scheduling the same tasks without distance constraints on multiple 
machines. Although this relationship is not always true, it indicates that the addition of distance 
constraints can drastically increase the complexity of a single machine scheduling problem. 



